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X Abstract 



In this paper, a few novel data hiding techniques are proposed. These tech- 
niques are improvements over the classical LSB data hiding technique and the 
Fibonacci LSB data-hiding technique proposed by Battisti et al. [Tj. The clas- 
sical LSB technique is the simplest, but using this technique it is possible to 
embed only in first few bit-planes, since image quality becomes drastically dis- 
torted when embedding in higher bit-planes. Battisti et al. [T] proposed an im- 
provement over this by using Fibonacci decomposition technique and generating 
a different set of virtual bit-planes all together, thereby increasing the number 
of bit-planes. In this paper, first we mathematically model and generalize this 
particular approach of virtual bit-plane generation. Then we propose two novel 
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embedding techniques, both of which are special-cases of our generahzed modeL 
The first embedding scheme is based on decomposition of a number (pixel- value) 
in sum of prime numbers, while the second one is based on decomposition in 
sum of natural numbers. Each of these particular representations generates a 
different set of (virtual) bit-planes altogether, suitable for embedding purposes. 
They not only allow one to embed secret message in higher bit-planes but also do 
it without much distortion, with a much better stcgo-image quality, in a reliable 
and secured manner, guaranteeing efficient retrieval of secret message. A com- 
parative performance study between the classical Least Significant Bit (LSB) 
method, the data hiding technique using Fibonacci -p-Sequence decomposition 
and our proposed schemes has been done. Theoretical analysis indicates that 
image quality of the stego-image hidden by the technique using Fibonacci de- 
composition improves against simple LSB substitution method, while the same 
using the prime decomposition method improves drastically against that using 
Fibonacci decomposition technique, and finally the natural number decomposi- 
tion method is a further improvement against that using prime decomposition 
technique. Also, optimality for the last technique is proved. For both of our 
data-hiding techniques, the experimental results show that, the stego-image is 
visually indistinguishable from the original cover image. 

Keywords 

Data hiding. Information Security, LSB, Fibonacci, Image Quality, Chebysev In- 
equality, Prime Number Theorem, Sieve of Eratosthenes, Goldbach Conjecture, 
Pigeon-hole Principle, Newton-Raphson method. 

1 Introduction 

Data hiding technique is a new kind of secret communication technology. It has 

been a hot research topic in recent years, and it is mainly used to convey mes- 
sages secretly by concealing the presence of communication. While cryptography 
scrambles the message so that it cannot be understood, steganography hides the 
data so that it cannot be observed. The main objectives of the steganographic 
algorithms are to provide confidentiality, data integrity and authentication. 

Most steganographic techniques proceed in such a way that the data which 
has to be hidden inside an image or any other medium like audio, video etc., is 
broken down into smaller pieces and they are inserted into appropriate locations 
in the medium in order to hide them. The aim is to make them un-perceivable 
and to leave no doubts in minds of the hackers who 'step into' media-files to 
uncover 'useful' information from them. To achieve this goal the critical data has 
to be hidden in such a way that there is no major difference between the original 
image and the 'corrupted' image. Only the authorized person knows about 
the presence of data. The algorithms can make use of the various properties 
of the image to embed the data without causing easily detectable changes in 
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them. Data embedding or water marking algorithms ([5], [5], [7], [S], [H], [2U] ) 

necessarily have to guarantee the following: 

• Presence of embedded data is not visible. 

• Ordinary users of the document /image are not affected by the watermark, 
i.e., a normal user does not see any ambiguity in the clarity of the docu- 
ment/image. 

• The watermark can be made visible/retrievable by the creator (and pos- 
sibly the authorized recipients) when needed; this implies that only the 
creator has the mechanism to capture the data embedded inside the doc- 
ument/image. 

• The watermark is difficult for the other eavesdropper to comprehend and 
to extract them from the channels. 

In this paper, we mainly discuss about using some new decomposition meth- 
ods in a classical Image Domain Technique, namely LSB technique (Least Sig- 
nificant Bit coding, ([H], [in]))j in order to make the technique more secure 
and hence less predictable. We basically generate an entirely new set of bit 
planes and embed data bit in these bit planes, using our novel decomposition 
techniques. 

For convenience of description, here, the LSB is called the 0*'' bit, the second 
LSB is called the 1** bit, and so on. We call the newly-generated set of bit-planes 
'virtual', since we do not get these bit-planes in classical binary decomposition 
of pixels. 

Rest of the paper is organized as follows: Sections 2 and 3 describes the 
embedding technique in classical LSB and Fibonacci decomposition technique 
with our modification. Section 4 describes a generalized approach that we follow 
in our novel data-hiding techniques using prime/natural number decomposition. 
Section 5 describes the embedding technique using the prime decomposition, 
while the experimental results obtained using this technique are reported in 
Section 6. In Section 7, we describe the other embedding technique, i.e., the one 
using the natural number decomposition, and the experimental results obtained 
using this technique are reported in Section 8. Finally, in Section 9 we draw our 
conclusions. 

2 The Classical LSB Technique - Data Hiding 
by Simple LSB Substitution 

Among many different data hiding techniques proposed to embed secret message 
within images, the LSB data hiding technique is one of the simplest methods 
for inserting data into digital signals in noise free environments, which merely 
embeds secret message-bits in a subset of the LSB planes of the image. Prob- 
ability of changing an LSB in one pixel is not going to affect the probability 
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of changing the LSB of the adjacent or any other pixel in the image. Data 
hiding tools, such as Steganos, StegoDos, HideBSeek etc are based on the LSB 
replacement in the spatial domain [2] . But the LSB technique has the following 
major disadvantages: 

• It is more predictable and hence less secure, since there is an obvious 
statistical difference between the modified and unmodified part of the 
stego-image. 

• Also, as soon as we go from LSB to MSB for selection of bit-planes for 
our message embedding, the distortion in stego-image is likely to increase 
exponentially, so it becomes impossible (without noticeable distortion and 
with exponentially increasing distance from cover-image and stego-image) 
to use higher bit-planes for embedding without any further processing. 

The workarounds may be: Through the random LSB replacement (in stead 
of sequential), secret messages can be randomly scattered in stego- images, so 
the security can be improved. 

Also, using the approaches given by variable depth LSB algorithm (Chen et 
al- lU); or by the optimal substitution process based on genetic algorithm and 
local pixel adjustment (Wang et al. |4j), one is able to hide data to some extent 
in higher bit-planes as well. 

We propose two novel new data-hiding schemes by increasing the available 
number of bit-planes using new decomposition techniques. Similar approach 
was given using Fibonacci-p-sequence decomposition technique by Battisti et 
al-([l], [12]), but we show the proposed decomposition techniques to be more 
efficient in terms of generating more virtual bit-planes and maintaining higher 
quality of stego-image after embedding. 

3 Generalized Fibonacci LSB Data Hiding Tech- 
nique 

This particular technique, proposed by Battisti et al. pi, investigates a different 
bit-planes decomposition, based on the Fibonacci-p-sequences, given by, 

^;,(o) = ^;,(i) = . . . = Fp{p) = 1 

i^p(n)=Fp(n-l)+Fp(n-p-l), Vn>p+1, n,peH (1) 

This technique basically uses Fibonacci-p-sequence decomposition, rather 
than classical binary decomposition (LSB technique) to obtain different set of 
bit-planes, embed a secret message-bit into a pixel if it passes the Zeckendorf 
condition, then while extraction, follow the reverse procedure. 

We shall slightly modify the above technique, but before that let us first 
generalize our approach, put forward a mathematical model and then propose 
our new data-hiding techniques as special-cases of the generalized model. 

For the proposed data hiding techniques our aim will be 
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• To expand the set of bit-planes and obtain a new different set of virtual 

bit-planes. 

• To embed secret message in higher bit-planes of the cover-image as well, 
maintaining high image quality, i.e., without much distortion. 

• To extract the secret message from the embedded cover-image efficiently 
and without error. 

4 A Generalized LSB Data Hiding Technique 

If we have k-bit cover image, there are only k available bit-planes where secret 
data can be embedded. Hence we try to find a function / that increases the 
number of bit-planes from k to n, n > fc, by converting the k-bit 8-4-2-1 standard 
binary pixel representation to some other binary number system with different 
weights. We also have to ensure less distortion in stego-image with increasing bit 
plane. As is obvious, in case of classical binary decomposition, the mapping / is 
identity mapping. But, our job is to find a non-identity mapping that satisfies 
our end. Figure-1 presents our generalized model, while Figure-2 explains the 
process of embedding. 

4.1 The Number System 

We define a number system by defining two things: 

• Base (radix) r (digits of the number system e {0, . . . , r — 1}) 

• Weight function W{.), where W{i) denotes the weight corresponding to 
i*'' digit (e.g., for 8-4-2-1 binary system, W{0) = 1, W{1) = 2, W{2) = 4, 

T4^(4) = 8). 

Hence, the pair {r,W{.)), defines a number system completely. Obviously, 
our decimal system can be denoted in this notation as (10, lO^'^). 

A number having representation dk-idk-2 ■ ■ ■ dido in number system (r, W^(.)) 
will have the following value (in decimal), D = J2iZo di.W{i), di G {0, 1, . . . , i — 
1}. This number system may have some redundancy if 3 more than one rep- 
resentation for the same value, e.g., the same (decimal) value D may be rep- 
resented as dk-idk-2 ■ ■ ■ dido and d'i._id'i^_2 ■ ■ ■ d'ld'o, i.e., D = Yli=Q di.W{i) = 
YliZo di.W{i), where d^,d'^ e {0, 1, . . . , r - 1}. Here d^ ^ d[ for at least 2 
different i s. 

To eliminate this redundancy and to ensure uniqueness, we should be able 
to represent one number uniquely in our number system. To achieve this, we 
must develop some technique, so that for number(s) having multiple (more 
than one, non-unique) representation in our number system, we can discard 
all representations but one. One way of doing this may be: from the multiple 
representations choose the one that has lexicographical highest (or lowest) value, 
discard all others. We shall use this shortly in case of our prime number system. 
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Figure 1: Basic block-diagram for generalized data-hiding technique 
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Figure 2: Basic block-diagram for embedding secret data-bit 



As shown in Figure-2, for classical binary number system (8-4-2-1), we use 
the weight function W{.) defined by, W{.) = 2( ) ^ W : i 2^ ^ Wii) = 2\ 
V? G Z+[j{0}, corresponding to i*^ bit-plane (LSB = 0*'' bit), so that a li- 
bit number (k-bit pixel-value) pk is represented as pk ~ 12'i=o ^ic-'^^: where 
bic € {0; 1} " this is our well-known binary decomposition. 

Now, our / converts this pk to some virtual pixel representation (in a 
different binary number system) with n (virtual) bit-planes, obviously we need 
to have n > k to expand number of bit planes. But finding such / is equiv- 
alent to finding a new weight function W{.), so that W{i) denotes the weight 
of i*'* (virtual) bit plane in our new binary number system, Vi G Z'^ [J {0}. 
Mathematically, p[^ — J27=o Kc i"^) ' where bic G {0,1} - this is our new 
decomposition, with the obvious condition that (Pfe)(2,2( )) — iPn)(2,w{.)) 

Also, W{i) must have less abrupt changes with respect to i, (i*'' bit plane, 
virtual), than that in the case of 2' , in order to have less distortion while 
embedding data in higher (virtual) bit planes. We call these expanded set of bit 
planes as virtual bit planes, since these were not available in the original cover 
image pixel data. 

But, at the same time we must ensure the fact that the function / that we 
use must be injective, i.e., invertible, unless otherwise we shall not be able to 
extract the embedded message precisely. 
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4.2 The Number System using Fibonacci p-Sequence De- 
composition 

Function / proposed by Battisti et al.[Tj converts the pixel in binary decomposi- 
tion to pixel in Fibonacci decomposition using generalized Fibonacci p-sequence, 
where corresponding weights are Fp{n), Vn e H, i.e., W{.) — Fibp{.), i.e., the 
number system proposed by them to model virtual bitplanes is (2,fp(.)). 

Since this number system too has redundancy (we can easily see it by ap- 
plying pigeon-hole principle), for uniqueness and to make the transformation 
invertible, Zeckendorf's theorem, has been used. 

4.2.1 Modification to ensure uniqueness 

Instead of Zeckendorf's theorem, we use our lexicographically higher prop- 
erty. Hence, if a number has more than one representation using Fibonacci 
p-sequence decomposition, only the one lexicographically highest will be valid. 
Using this technique we prevent some redundancy also, since numbers in the 
range [0,Y^^^fl Fp(i)] can be represented using n-bit Fibonacci-p-sequence de- 
composition. For an 8-bit image, the set of all possible pixel-values in the range 
[0,255] has the corresponding classical Fibonacci {p—1, Fibonacci-l-sequence, 
Fibonacci series ([TOj, [H], [13) ) decomposition as shown in Table- 1. One may 
use this map to have a constant-time Fibonacci decomposition from pixel values 
into 12 virtual bit-planes. 

5 Proposed approach 1 : The Prime Decompo- 
sition Technique 

5.1 The Prime Number System and Prime Decomposition 

We define a new number system, and as before we denote it as (2,P(.)), where 
the weight function P{.) is defined as, 

P(0) = 1, 

Pii) = p„ yi e z+ , (2) 

Pi — i*'' Prime, 
pi = 2,p2 = 3,p3 = 5, . . . 

Po = 1 

Since the weight function here is composed of prime numbers, we name 
this number system as prime number system and the decomposition as prime 
decomposition. 

As we have discussed earlier, if a number has more than one representation 
in our number system, we always choose the lexicographically highest of them 
as valid, e.g., '3' has two different representations in 3-bit prime number system, 
namely, 100 and Oil, since we have. 
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Table 1: Fibonacci (1-sequence) decomposition for 8-bit image yielding 12 vir- 
tual bit-planes g 



l.P(2) + O.P(l) + O.P(O) = l.p2 + O.pl + 0.1 = 1.3 + 0.2 + 0.1 = 3 
0.P(2) + l.P(l) + l.P(O) = 0.p2 + l.pl + 1.1 = 0.3 + 1.2 + 1.1 = 3 



100 being lexicographically (from left to right) higher than Oil, we choose 
100 to be valid representation for 3 in our prime number system and hence 
discard Oil, which is no longer a valid representation in our number system. 

3 = rnax,_^^„^,^^(100, 011)^100. 

Hence, for our 3-bit example, the valid representations are: 000 O 0, 001 -O- 
1,010 <H- 2,100 <H- 3,101 <H- 4,110 <H- 5,111 <H- 6. Numbers in the range [0,6] 
can be decomposed using our 3-bit prime number system uniquely, with only 
the representation Oil avoided. 

Now, let us proceed with this very simplified example to see how the secret 
data bit is going to be embedded. Wc shall embed a secret data bit into a 
(virtual) bit-plane by just simply replacing the corresponding bit by our data 
bit, if we find that after embedding, the resulting representation is a valid rep- 
resentation in our number system, otherwise wc do not embed, just skip. This 
is only to guarantee the existence of the inverse function and proper extraction 
of our secret embedded message bit. 

Again, let us elucidate by our previous 3-bit example. Let the 3-bit pixel 
within which we want to embed secret data be of value 2, use prime decompo- 
sition to get 010, and we want to embed in the LSB bit-plane, let our secret 
message bit to be embedded be 1. So, we just replace the pixel LSB by data 
bit 1 and immediately see that after embedding the pixel, it will become Oil, 
which is not a valid representation, hence we skip this pixel without embedding 
our secret data bit. 

Had we used this pixel value for embedding and after embedding ended up 
with pixel value Oil (value 3), we might get erroneous result while extraction 
of the secret bit. Because during extraction decomposition of embedded pixel 
value 3 would wrongly give 100 instead of Oil, and extraction of LSB virtual 
bit-plane would wrongly give the embedded bit as instead of its true value 1. 
Figure-3 explains this error pictorially. 

Hence, embed secret data bit only to those pixels, where after embedding, 
we get a valid representation in the number system. 

5.2 Embedding algorithm 

• First we find the set of all prime numbers that are required to decompose a 
pixel value in a k-bit cover-image, i.e., we need to find a number n € K such 
that all possible pixel values in the range [0, 2*^ — 1] can be represented 
using first n primes in our n-bit prime number system, so that we get 
n virtual bit-planes after decomposition. We can use Sieve method, for 
example, to find primes. (To find the n is quite easy, since we sec, using 
Goldbach conjecture etc, that all pixel-values in the range [0, X]™o^P»] 
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Figure 3: Error in not guaranteeing uniqueness of transformation 



can be represented in our m-bit prime number system, so all we need to 
do is to find an n such that ^Y^Zq Vi > 2*^ — 1, since the highest number 
that can be represented in n-bit prime number system is 'Y^=^ Vi- 

• After finding the primes, we create a map of k-bit (classical binary de- 
composition) to n-bit numbers (prime decomposition), n > k, marking all 
the valid representations (as discussed in previous section) in our prime 
number system. For an 8-bit image the set of all possible pixel-values in 
the range [0,255] has the corresponding prime decomposition as shown in 
Table-2, As one may notice, the size of the map to be stored has been 
increased in this case, indicating a slightly greater space complexity, 

• Next, for each pixel of the cover image, we choose a (virtual) bit plane, 
say p*'* bit-plane and embed the secret data bit into that particular bit 
plane, by replacing the corresponding bit by the data bit, if and only if 
we find that after embedding the data bit, the resulting sequence is a 
valid representation in n-bit prime number system, i,e., exists in the map 
otherwise discard that particular pixel for data hiding, 

• After embedding the secret message bit, we convert the resultant sequence 
in prime number system back to its value (in classical 8-4-2-1 binary 
number system) and we get our stego-image. This reverse conversion is 
easy, since we need to calculate J2i=Q ^i-Pi only, where bi € {0,1}, Vi G 
{0,n-l} 

5.3 Extraction algorithm 

The extraction algorithm is exactly the reverse. From the stego-image, we 
convert each pixel with embedded data bit to its corresponding prime decom- 
position and from the p*'' bit-plane extract the secret message bit. Combine 
all the bits to get the secret message. Since, for efficient implementation, we 
shall have a hash-map for this conversion, the bit extraction is constant-time, 
so the secret message extraction will be polynomial (linear) in the length of the 
message embedded. 
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26 
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218 
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27 
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219 


1 1 1 111001000010 


28 
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220 


1 1 1 111001000100 
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110000010000001 
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39 
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110000100000000 
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111100001000010 
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111111100010001 


40 


001000000000100 
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110000100000001 
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111100001000100 
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111111100010010 


41 
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42 
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234 
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43 
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110001000000000 
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111100100000000 
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44 
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110001000000001 
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45 
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46 
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111101000000000 
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48 
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49 


100000000001001 
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111101000000010 
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111111110000001 


50 
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110010000000001 
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242 


111111110000010 


51 


100000000010001 


115 


110100000000000 
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111101000000101 


243 


111111110000100 


52 


100000000010010 


116 


110100000000001 


180 


111101000001000 


244 


111111110000101 


53 


100000000010100 


117 


110100000000010 
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111110000000000 


245 


111111110001000 


54 
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111111110001001 




lODonoonoioonoi 


119 


llDlOOOOOOODini 


1«3 


iiiiioooonnooin 




111111110011)000 


56 


l()()OlJintilil)l)OOini 




ilOlOlJintDDOlOOn 


1 •< 1 


1 i i 1 HjlJIHtDDlllOl) 


2 1-^ 


1 11 i 1 1 1 IIHUDDOI 


57 


lOOOOOODlOOOOOi 


121 


lliOOOOODDOOOOl) 




illiiOOOODDOiOl 


2 19 
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58 
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250 


111111110010100 


59 
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123 


111000000000010 


187 


111110000001001 


251 


111111110100000 


60 


100000010000000 


124 


111000000000100 


188 


111110000010000 


252 


111111110100001 


61 


100000010000001 


125 


111000000000101 


189 


111110000010001 


253 


111111111000000 


62 


100000100000000 


126 


111000000001000 


190 


111110000010010 


254 


111111111000001 


63 


100000100000001 


127 


111000000001001 


191 


111110000010100 


255 


111111111000010 



Table 2: Prime decomposition for 8-bit image yielding 15 virtual bit-planes 
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5.4 The performance analysis : Comparison between clas- 
sical Binary, Fibonacci and Prime Decomposition 

In this section, we do a comparative study between the different decompositions 
and its effect upon higher-bit-plane data-hiding. We basically try to prove our 
following two claims, by means of the following theorems from Number Theory 

m- 

5.4.1 The Prime Number Theorem : A Polynomial tight bound for 
Primes 

By Tchebychef theorem, 0.92 < ""^"^^J"^"^^ < 1.105, Va; > 2, where tt{x) denotes 
number of primes not exceeding x, i.e., 7t{x) = 9 (j^)- This leads to famous 

Prime Number theorem lim„_j.oo ^ (^njin{n)) ) ^ From this one can show [1] 
that, if p„ be the n*'' prime, 3LI, L2 e 3?, such that LI < (^ (^nh"(7i)) ^ ^ - 
2, n e Z+, i.e., lim„^^ ( („!„'(„)) ) = 1- 

Pn - 9{n. ln(n)) (3) 

5.4.2 A lower bound for the Fibonacci-p-Sequence 

The Fibonacci-p-sequence, for p > 1, p e H, is given by, 

Fp{0) = Fp{l) = . . . = Epip) = I, 
Fp{n) = Fp{n - 1) + Fp{n - p - 1), Vn > p + I, n e ^ 

We prove the following lemmas and find 

Lemma- 1: If the ratio of two consecutive numbers in Fibonacci p-sequence 
converges to limit ap G 3?"*", ap satisfies the equation xP~^^ — cc'' — 1 = 0, Vp € H. 

Proof: 

T / fn+p \ 1. / /n+p— 1 \ / fn \ 

ap = Imi = hm — = . . . = hm = ■ ■ ■ , 

n^oo \Jn+p-lJ "^oo V /" / \Jn-lJ 

fn — n*'^ number in the Fibonacci — p Sequence, fn+p — fn+p-i + fn-i 
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^ - - 1 = 



Lemma-2: If ap be a +ve root of the equation x^^^ — a;^ — 1 = 0, we have 
1 < Qfp < 2, Vp e H. 

Proof: We have, 

aP+^ -aP-l = Q also, 2^+^ - 2^ - 1 = 2" - 1 > 0, Vp G Z+ 

^ 2P - 1 > a^+i - aP - 1 ^ (2f - a^) > - 2) (4) 

Also, 

- 1 < = - - 1 a^(ap - 1) > => ofp > 1 (since positive) (5) 
From Q, we immediately see the following: 

• Qfp > according to our assumption, hence we can not have ap = 2 (LHS 
& RHS both becomes 0, that does not satisfy inequality (|4|) 



• If ftp > 2, we have LHS < while RHS > which again does not satisfy 
inequality Q. 

• Hence we have ap < 2, Vp G H 

From ([5|, we have, ap > 1. Combining, we get, 1 < ap < 2, Vp € H 

Lemma-3: If ap be a +ve root of the equation — xP — 1 — Q, where p € H, 
we have, 

. afe+i > ^ 

• al< (fc + 1), Vfc e H 

Proof: We have. 

For p = k, al+^ - a^ - 1 = 
For p = fc + 1, al+l - al+\ -1 = 
=^a^+J(afc+i-l)=at(afe-l) 



k + l 

From ^ we can argue, 

• afc 7^ Q^fe+i, since neither of them is or 1 (from lemma-2). 
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• If afe < ak+i, we have LHS of inequality ([6]) < 1, but RHS > 1, since 
both the terms in RHS will be greater than 1 (by our assumption and by 
lemma-2), a contradiction. 

• Hence, we must have 

Uk > ttfe+i, V/c G H (7) 

Again, from ([6| we have, 

^ i \ ^^^^ > gi^^g (—] > 1, from m 

2 > Qffc+i > I -'^^^^ ^ I ^ (from lemma-2) 

V"fc+i - 1/ 

^ > (8) 

Now, let us induct on p to prove < p + 1. 

Base case: for p = 1, ai < 2 , by lemma-2 
Let us assume the inequality holds < k ^ oi^ < p + l^p < k 

Induction Step: for p = fc + 1, al+\ = al. (^ ^'^ , by ^ 

ttfe+i < {k + 1). i 5— ) , by induction hypothesis 

=^ ttfc+i < (fc + 1). 1 + — 



ttfc+i < (fc + 1) + 1, ( from (§, wc havc,-^ ^ < 1 



^ a^+} < (fc + 2) 
a^< (p+l),VpeH (9) 



Lemma-4 The following inequalities always hold: 

• (fc + 1)^ < fc^ < . . . < 43 < 35 < 2 

• aP <p+l^ aP^'^ <p^...al<4^al<3^ap<2 



Proof: By Binomial Theorem, we have, 

''"^ (fc - 1)! ''"^ 



k-l 



n=0 ^ ' 11=1 r=l 

< (1 + 1 + 1 + .. + 1) .k''-^ = fc.fc'^-1 ^ k''^ {k + 1)T^ < fc^ (10) 



k times 
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Hence, we have, (fc + 1)^ < /c^-i < . . . < 4s < 35 < 2 
Also, from ^ we have, at < (fc + 1)^. 
Combining, we get, 

Q!fc < (fc + 1)^ < fc^ < . . . < 43 < 3^ < 2 
Qffe < (fc + 1) ^ a^"^ <k...^al<5^al<4:^al<3^ak<2 (11) 

Lemma-5 The following inequality gives us the lower bound, 

Fp{n) > a^-P, Vn > p, n G H (12) 
where ap is the +ve root of the equation a;^+^ — — 1 = 0. 

Proof: We induct on n to show the result. 

Fp{0) = Fp{l) = . . . = Fp{p) = 1, (By definition of Fibonacci-p-Sequence) . 
Base case : 

n=p+l, Fp{p + 1) = Fp{p) + Fp{0) = 1 + 1 = 2 > ftp, (From Lemma-4) 
n=p + 2, Fp{p + 2) = Fp(p + 1) + Fp{l) = 2 + 1 = 3 > Q^p^ (From Lemma-4) 
n=p + i, Fp{p + 3) = Fp(p + 2) + Fp{2) = 3 + 1 = 4 > ap^, (From Lemma-4) 

n = p + (p -fl), Fp{p + p+ 1) = Fp{p + p)+ Fp{p) = (p -M) -I- 1 

= p -f- 2 > apP^^, (From Lemma-4) 



Induction Step: 

Let's assume the above result is true Vm < n, m, n e H, for to > 2p + 1 as 
well. Then we have, 

Fpin) = Fp{n - 1) + Fp{n - p - 1) > 0;^-^"' + a^-^P-^ (hypothesis) 
^ Fp(n) > a;-^P-\{l + aPp) = a^-^P-i.aP+i = a^-P 
^ i^p(n) > a^'P, \/n> p, n € H 



Hence, we have the following inequality, 

Fp{n) > [apT-P 
ap e 3?+ 

ai = ^-^^ « 1.618034 
a2 w 1.465575 
as ~ 1.380278 
a4 « 1.324718 

Up > Ckp+i, Vp G 
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267914296 
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433494437 
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86267571272 


73682389315.076 


139583862445 


119220644109.601 


225851433717 


192903109060.823 


365435296162 


312123875552.315 


591286729879 


505027182631.253 


956722026041 


817151378583.699 


1548008755920 


1322179079633.401 


2504730781961 


2139331297036.010 


4052739537881 


3461511733907.302 


6557470319842 


5600845227000.975 


10610209857723 


9062360514205.225 


17167680177565 


14663211490563.064 


27777890035288 


23725581307425.750 



Table 3: ai is a +ve Root oi — x — 1 — i.e., ai w 1.618034 



The sequence Up is decreasing in p. 

The empirical results illustrated in Tables 3 and 4 also depict the same: 
5.4.3 Measures 

As we know, Security, embedding distortion and embedding rate can be used as 
schemes to evaluate the performance of the data hiding schemes. The following 
are the popular parameters, 

• Entropy - A steganographic system is perfectly secure when the statistics 
of the cover-data and stego-data are identical, which means that the rel- 
ative entropy between the cover data and the stego-data is zero. Entropy 
considers the information to be modeled as a probabilistic process that can 
be measured in a manner that agrees with intuition ^38j.The information 
theoretic approach to steganography holds capacity of the system to be 
modeled as the ability to transfer information ([H], [53], [37]). 

• Mean Squared Error and SNR - The (weighted) mean squared error be- 
tween the cover image and the stego-image (embedding distortion) can 
be used as one of the measures to assess the relative perceptibility of the 
embedded text. Imperceptibility takes advantage of human psycho visual 
redundancy, which is very difficult to quantify. Mean square error (MSE) 
and Peak Signal to Noise Ratio (PSNR) can also be used as metrics to 
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85626 


65193.007 


125491 


95544.996 


183916 


140027.997 


269542 


205221.004 


395033 


300766.000 


578949 


440793.997 


848491 


646015.002 


1243524 


946781.002 


1822473 


1387574.999 


2670964 


2033590.001 


3914488 


2980371.002 


5736961 


4367946.001 


8407925 


6401536.002 


12322413 


9381907.004 


18059374 


13749853.006 


26467299 


20151389.008 


38789712 


29533296.012 


56849086 


43283149.019 


83316385 


63434538.027 


122106097 


92967834.041 


178955183 


136250983.061 


262271568 


199685521.092 


384377665 


292653355.137 


563332848 


428904338.205 



Table 4: a2 is a +ve Root of a;^ - a;^ - 1 = 0, i.e., a2 « 1.465571 



measure the degree of impcrccptibility: 

M N 



MSE = J2J2(fi:i-9v)VMN 
i=i j=i 



where M and N are the number of rows and number of columns respec- 
tively of the cover image, fij is the pixel value from the cover image, gij 
is the pixel value from the stego-image, and L is the peak signal value of 
the cover image (for 8-bit images, L = 255. In general, for k-bit grayscale 
image, we have L/^ =2*^ — 1). Signal to noise ratio quantifies the imper- 
ceptibility, by regarding the image as the signal and the message as the 
noise. 

Here, we use a slightly different test-statistic, namely, Worst-case-Mean- 
Square-Error (WMSE) and the corresponding PSNR (per pixel) as our test- 
statistics. We define WMSE as follows: 

If the secret data-bit is embedded in the i*'* bitplane of a pixel, the worst-case 
error-square-per-pixel will be = WSE = |T4^(«)(1 — 0)^ — {W{i))'^, correspond- 
ing to when the corresponding bit in cover-image toggles in stego-image, after 
embedding the secret data- bit. For example, worst-case error-square-per-pixel 
for embedding a secret data-bit in the i*^ bit plane in case of a pixel in classical 
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binary decomposition is = (2*)^ — 4% where i £ Ui^}- If the original k-bit 
grayscale cover-image has size w x h, we define, WMSE = w x h x (W{i))'^ = 
wxhx WSE. Here, we try to minimize this WMSE (hence WSE) and maximize 
the corresponding PSNR. We use the results ^ and ( 12 ) to prove our following 
claims: 



5.4.4 The proposed Prime Decomposition generates more (virtual) 
bit-planes 

Using Classical binary decomposition, for a k-bit cover image, we get only k 
bit-planes per pixel, where we can embed our secret data bit. From ([S]) and 



( |12[ ), we get, 

' Pn — 0{n. Inn) 

• 3ap e 5R+ : Fp{n) > (ap)"""^ , ctp > ftp+i , Vp G Z+ , ai w 1.618 

Since n. Inn = o(ckp), it directly implies that p„ = o{Fp{n)). The maximum 
(highest) number that can be represented in n-bit number system using our 
prime decomposition is Y^^=o Pi^ ^^'^ case of n-bit number system using 
Fibonacci p-sequence decomposition is Y^^Zo Ppi''-)- Now, it is easy to prove 
that, 3no G H : Vn > ng we have, Y^^I^ Fp{i) > Yli=o Pi- 

Hence, using same number of bits it is possible to represent more numbers 
in case of the number system using Fibonacci-p-sequence decomposition, than 
that in case of the number system using prime decomposition, when number of 
bits is greater than some threshold. This in turn implies that number of virtual 
bit-planes generated in case of prime decomposition will be eventually (after 
some n) more than the corresponding number of virtual bit-planes generated by 
Fibonacci p-Sequence decomposition. 

From the bar-chart shown in Figure-6, we see, for instance, to represent 
the pixel value 131, prime number system requires at least 12 bits, while for its 
Fibonacci counterpart 10 bits suffice. So, at the time of decomposition the same 
pixel value will generate 12 virtual bit-planes in case of prime decomposition 
and 10 for the later one, thereby increasing the space for embedding. 



5.4.5 Prime Decomposition gives less distortion in higher bit-planes 

Here, we assume the secret message length (in bits) is same as image size, for 
evaluation of our test statistics. For message with different length, the same 
can similarly be derived in a straight-forward manner. 

In case of our Prime Decomposition, WMSE for embedding secret message 
bit only in Z*'' (virtual) bitplane of each pixel (after expressing a pixel in our 
prime number system, using prime decomposition technique) — pf, because 
change in bit plane of a pixel simply implies changing of the pixel value by 
at most Z"* prime number. 

From the above discussion and using equation ([3|, also treating image-size 
as constant we can immediately conclude, (for I > 0) 
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Figure 4: Maximum number that can be represented in different decomposition 
techniques 
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(^^^ ^ l^^bitplane) p^.j^jy^^_ £)^^^^p^g,^^^^j^ ^ ^ h X 9{l -log (0)' (-^*^) 

whereas WMSE in case of classical (traditional) binary (LSB) data hiding 
technique is given by, 

The above result imphes that the distortion in case of prime decomposition 
is much less (since polynomial) than in case of classical binary decomposition 
(in which case it is exponential). 

Now, let us calculate the WMSE for the embedding technique using Fi- 
bonacci p-sequence decomposition. In this case, WMSE for embedding secret 
message bit only in l^^ (virtual) bit-plane of each pixel (after expressing it using 
Fibonacci- 1-sequence decomposition) — {Fp{l)) , because change in plane 
of a pixel simply implies changing of the pixel value by at most l*'^ Fibonacci 
number. 

From inequality (12), we immediately get that in case of p = 1, i.e., for the 
Fibonacci- 1-sequence decomposition, we have. 



{WMSE,. ,U,lane)p^^^^^^_,_Se,uence Decom,os.Uon = (^(0)' = ^ ((2.618)') 

Similarly, for other values of p, one can easily derive (by induction) some ex- 
ponential lower-bounds, which are definitely better than the exponential bound 
obtained in case of classical binary decomposition, but still they are exponential 
in nature, even if the base of the exponential lower bound will decrease gradually 
with increasing p. So, we can generalize the above result by the following, 

aj, e JH^, ai = ^ , 

The sequence is decreasing in p . Obviously, Fibonacci-p-sequence decompo- 
sition, despite being better than classical binary decomposition, is still exponen- 
tial and causes much-more distortion in the higher bit-planes, than our prime 
decomposition, in which case WMSE is polynomial (and not exponential!) in 
nature. The plot shown in Figure-5 proves our claim, it vindicates the polyno- 
mial nature of the weight function in case of prime decomposition against the 
exponential nature of classical binary and Fibonacci decomposition. 

So from all above discussion, we conclude that Prime Decomposition gives 
less distortion than its competitors (namely classical binary and Fibonacci De- 
composition) while embedding secret message in higher bit-planes. 

At a glance, results obtained for test-statistic WMSE, for our k-bit cover 
image. 
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Figure 5: Weight functions for different decomposition techniques 



^^VMSEith bitplane) Qifj_ggj^f.fj_i Binary Decomposition ^(^^ )• 



' Classical Binary Decomposition 
^ Prime Decomposition 



{WMSE,. Ut,Une),^,^^ Oecorr^osUion = ^^il' -WH)). 



{^^V S E j^th bitplane) pj_^QjiQ^(.Q2,—p Decomposition ^ (('^p) ) ' 

ap e 2.618 > > ap+i,Vp e Z+, with 

Fibonacci— 1 Decomposition ~ ^ ((2.618) ) (-^^) 

Also, results for our test-statistic PSNRyjorst, 

(2^= - 1)2 



({PSNR^orstU,uplane)„. „ ,, = lO.^O.g 

V ^ / Binary Decomposition 

({PSNR^orst)l^H,Uplane)^., . „ = lO-^O^lO ( 

\ ^ / Fibonacci— p Decomposition \ 



(2^ - If ' 

ap e 2.618 > ap> ap+i,\/p e Z+, mt/i 



^^^^^^^^ Fibonacci— 1 Decomposition ^ (2.618)^ ^ ^ ^ 

6 Experimental Results for data-hiding technique 
using Prime decomposition 

We have, as input: 
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• Cover Image: 8-bit (256 color) gray-level standard image of Lena. 

• Secret message length = cover image size, (message string "sandipan" 
repeated multiple times to fill the cover image size). 

• The secret message bits are embedded into one (selected) bit-plane per 
pixel only, the bitplane is indicated by the variable p . 

• The test message is hidden into the chosen bitplane using different de- 
composition techniques, namely, the classical (traditional) binary (LSB) 
decomposition, Fibonacci 1-sequence decomposition and Prime decompo- 
sition separately and compared. 

We get, as output: 

• As was obvious from the above theoretical discussions, our experiment 
supported the fact that was proved mathematically. 

• As obvious, as the relative entropy between the cover-image and the stego- 
image tends to be more and more positive (i.e., increases), we get more 
and more visible distortions in image rather than invisible watermark. 

• Figure-6 shows gray level [0 . . . 255] vs. frequency plot of the cover image 
and stego image in case of classical LSB data-hiding technique. As seen 
from the figure, we get only 8 bit-planes and the frequency distribution 
(as shown in histograms) and hence the probability mass function [2 7) 
corresponding to gray-level values changes abruptly, resulting in an in- 
creasing relative entropy between cover-image and stego-image, implying 
visible distortions, as we move towards higher bit-planes for embedding 
data bits. 

• The next figure (Figure-7) shows gray level [0 . . . 255] vs. frequency plot of 
the cover image and stego image in case of data-hiding technique based on 
Fibonacci decomposition. This figure shows that, we get 12 bit-planes and 
the probability mass function corresponding to gray-level values changes 
less abruptly, resulting in a much less relative entropy between cover-image 
and stego-image, implying less visible distortions, as we move towards 
higher bit-planes for embedding data bits. 

• The next figure (Figure-8) again shows gray level [0 . . . 255] vs. frequency 
plot of the cover image and stego image in case of data-hiding technique 
based on Prime decomposition. This figure shows that, we get 15 bit- 
planes and the change of frequency distribution (and hence probability 
mass function) corresponding to gray-level values is least when compared 
to the other two techniques, eventually resulting in a still less relative 
entropy between the cover image and stego-image, implying least visible 
distortions, as we move towards higher bitplanes for embedding data bits. 
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stego imas^ afier tuiasis- sec-iiel d£tta biz in iJeiifftneni bit phones in £^3 tsrrn.mijiie 



Figure 6: Frequency distribution of pixel gray-levels in different bit-planes before 
and after data-hiding in case of classical LSB technique 

• Data-hiding technique using the prime decomposition has a better perfor- 
mance than that of Fibonacci decomposition, the later being more efhcient 
than classical binary decomposition, when judged in terms of embedding 
secret data bit into higher bit-planes causing least distortion and thereby 
having least chance of being detected, since one of principal ends of data- 
hiding is to go as long as possible without being detected. 

• Using classical binary decomposition, we get here only 8 bit planes (since 
an 8-bit image), using Fibonacci 1-sequence decomposition we have 12 
(virtual) bit-planes, and using prime decomposition we have still higher, 
namely 15 (virtual) bit-planes. 

• As vindicated in Figure-9, distortion is highest in case of classical binary 
decomposition, less prominent in case of Fibonacci, and least for prime. 

This technique can be enhanced by embedding into more than one (virtual) 
bit-plane, following the variable-depth data-hiding technique [5T]. 

7 Proposed approach 2 : The Natural Number 
Decomposition Technique 

For further improvement in the same line, we introduce a new number system 
and use transformation into that in order to get more (virtual) bit-planes, and 
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Figure 7: Frequency distribution of pixel gray-levels in different bit-planes before 
and after data- hiding in case of Fibonacci (1-sequence) decomposition technique 
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Figure 8: Frequency distribution of pixel gray-levels in different bit-planes before 
and after data-hiding in case of Prime decomposition technique 
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also to have better image quality after embedding data into higher (virtual) 
bit-planes. 

7.1 The Proposed Decomposition in Natural Numbers 

We define yet another new number system, and as before we denote it as 
(2, A^(.)), where the weight function N{.) is defined as, W{i) = N{i) = Vi e 
Z+[j{0} 

Since the weight function here is composed of natural numbers, we name 
this number system as natural number system and the decomposition as natural 
number decomposition. 

This technique also involves a lot of redundancy. Proving this is again very 
easy by using pigeonhole principle. Using n bits, we can have 2" different binary 
combinations. But, as is obvious and we shall prove shortly that using n bits, 
all (and only) the numbers in the range [0, n(n + l)/2], i.e., total "^"^^"^^ + 1 
different numbers can be represented using our natural number decomposition. 
Since by induction one can easily show, 2" > j^i^ Vn > 2, n G H, we con- 

clude, by Pigeon hole principle that, at least 2 representations out of 2" binary 
representations will represent the same value. Hence, we have redundancy. 

As we need to make our transform one-to-one, what we do is exactly the 
same that we did in case of prime decomposition: if a number has more than 
one representation in our number system, we always take the lexicographically 
highest of them, (e.g., the number 3 has 2 different representations in 3-bit 
natural number system, namely, 100 and Oil, since we have, 1.3 -|- 0.2 -|- 0.1 = 3 
and 0.3 + 1.2 + 1.1 = 3. But, since 100 is lexicographically (from left to right) 
higher than Oil, we choose 100 to be valid representation for 3 in our natural 
number system and thus discard Oil, which is no longer a valid representation 
in our number system. 3 = max , . , . (100,011) = 100 So, in our 3-bit 

iextcograph'tc v ' / ' 

example, the valid representations are: 000 O 0,001 O 1,010 O 2,100 O 
3, 101 -H- 4, 110 -H- 5, 111 -H- 6 Also, to avoid loss of message, we embed secret 
data bit to only those pixels, where, after embedding we get a valid represen- 
tation in the number system. It is worth noticing that, up-to 3- bits, the prime 
number system and the natural number system are identical, after that they are 
different. 

7.2 Embedding algorithm 

• First, we need to find a number n G H such that all possible pixel values 
in the range [0, 2^ — 1] can be represented using first n natural numbers in 
our n-bit prime number system, so that we get n virtual bit-planes after 
decomposition. To find the n is quite easy, since we see, and we shall prove 
shortly that, in n-bit Natural Number System, all the numbers in the range 
[0,ri(n-|-l)/2] can be represented. So, our job reduces to finding an n such 
that "^"2^^^ > 2*^ — 1, i.e., solving the following quadratic in-equality 

+n- 2*=+^ +2>Q, 
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=> n > 



-l + \/2'=+3 + 9 



neZ+ 



(17) 
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• After finding n, we create a map of k-bit (classical binary decomposition) 
to n-bit numbers (natural number decomposition), n > k , marking all 
the valid representations (as disciisscd in previous section) in our natural 
number system. For an 8-bit image the set of all possible pixel-values in 
the range [0, 255] has the corresponding natural number decomposition as 
shown in Table-5. 

For fc = 8, we get, 



Hence, for an 8-bit image, we get 23 (virtual) bit-planes. 

If we recapitulate our earlier result, as we see from the map shown in Table- 
2, in case of prime decomposition, it yields much less numbers of (virtual) 
bit planes (namely 15). Again it is noteworthy that the space to store the 
map is still increased. Although this computation of the map (one-time 
computation for a fixed value of k) is slightly more expensive and takes 
more space to store in case of our natural number decomposition than 
in case of prime decomposition, the first outperforms the later one when 
compared in terms of steganographic efficiency, i.e., in terms of embedded 
image quality, security (since number of virtual bit-planes will be more in 
case of the first) etc, as will be explained shortly. 

• Next, for each pixel of the cover image, wc choose a (virtual) bit plane, 
say p*'' bit-plane and embed the secret data bit into that particular bit 
plane, by replacing the corresponding bit by the data bit, if and only if 
we find that after embedding the data bit, the resulting sequence is a 
valid representation in n-bit prime number system, i.e., exists in the map 
otherwise discard that particular pixel for data hiding. 

• After embedding the secret message bit, we convert the resultant sequence 

in prime number system back to its value (in classical 8-4-2-1 binary num- 
ber system) and we get our stego-image. This reverse conversion is easy, 
since we need to calculate J2^=o + 1) only, &, e {0, 1}, Vi e {0, n— 1}. 

7.3 Extracting algorithm 

The extraction algorithm is exactly the reverse. From the stego-image, we 
convert each pixel with embedded data bit to its corresponding natural decom- 
position and from the p*'* bit-plane extract the secret message bit. Combine 
all the bits to get the secret message. Since, for efficient implementation, we 
shall have a hash-map for this conversion, the bit extraction is constant-time, 
so the secret message extraction will be polynomial (linear) in the length of the 
message embedded. 



n > 



-1 + V28+3 + 9 _ -1 + \/2057 _ 44.35 
2 ~ 2 ~ 2 



= 22.675 ^ n = 23 
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N 


Natural Dccomp 


N 


Natural Dccomp 





00000000000000000000000 


64 


11001000000000000000000 


1 


00000000000000000000001 


65 


11010000000000000000000 


2 


00000000000000000000010 


66 


11100000000000000000000 


3 


00000000000000000000100 


67 


11100000000000000000001 


4 


00000000000000000001000 


68 


11100000000000000000010 


5 


00000000000000000010000 


69 


11100000000000000000100 


6 


00000000000000000100000 


70 


1 1 100000000000000001000 


7 


00000000000000001000000 


71 


11100000000000000010000 


8 


00000000000000010000000 


72 


1 1 100000000000000100000 


9 


00000000000000100000000 


73 


11100000000000001000000 


10 


oooooooonononionononono 


74 


1 1 1 n n n n n n 1 n n 


11 


UDUOOOOOOUDUIUOOOOODUDU 


TT) 


1111)UI)U()U(I0()0()1UI)UIK)()00 


12 


00000000000100000000000 


76 


11100000000001000000000 


13 


00000000001000000000000 


77 


11100000000010000000000 


14 


00000000010000000000000 


78 


11100000000100000000000 


15 


00000000100000000000000 


79 


11100000001000000000000 


16 


00000001000000000000000 


80 


11100000010000000000000 


17 


00000010000000000000000 


81 


11100000100000000000000 


18 


00000100000000000000000 


82 


11100001000000000000000 


19 


00001000000000000000000 


83 


11100010000000000000000 


20 


00010000000000000000000 


84 


11100100000000000000000 


21 


00100000000000000000000 


85 


1 1 101000000000000000000 


22 


01000000000000000000000 


86 


1 1 1 10000000000000000000 


23 


10000000000000000000000 


87 


1 1 1 10000000000000000001 


24 


10000000000000000000001 


88 


11110000000000000000010 


25 


10000000000000000000010 


89 


1 1 1 10000000000000000100 


26 


1 0000000000000000000100 


90 


11110000000000000001000 


27 


10000000000000000001000 


91 


1 1 1 10000000000000010000 


28 


10000000000000000010000 


92 


11110000000000000100000 


29 


10000000000000000100000 


93 


11110000000000001000000 


30 


10000000000000001000000 


94 


11110000000000010000000 


31 


10000000000000010000000 


95 


11110000000000100000000 


32 


10000000000000100000000 


96 


11110000000001000000000 


33 


10000000000001000000000 


97 


11110000000010000000000 


34 


10000000000010000000000 


98 


11110000000100000000000 


35 


10000000000100000000000 


99 


11110000001000000000000 


36 


10000000001000000000000 


100 


1 1 1 10000010000000000000 


37 


10000000010000000000000 


101 


1 1 1 10000100000000000000 


38 


10000000100000000000000 


102 


11110001000000000000000 


39 


10000001000000000000000 


103 


1 1 1 10010000000000000000 


40 


10000010000000000000000 


104 


11110100000000000000000 


41 


10000100000000000000000 


105 


1111 1000000000000000000 


42 


10001000000000000000000 


106 


11111000000000000000001 


43 


10010000000000000000000 


107 


11111000000000000000010 


44 


10100000000000000000000 


108 


11111000000000000000100 


45 


11000000000000000000000 


109 


11111000000000000001000 


46 


11000000000000000000001 


110 


11111000000000000010000 


47 


11000000000000000000010 


111 


11111000000000000100000 


48 


1 1000000000000000000100 


112 


111 1 1000000000001000000 


49 


1 1000000000000000001000 


113 


1111 1000000000010000000 


50 


1 1000000000000000010000 


114 


1111 1000000000100000000 


51 


1 1000000000000000100000 


115 


1111 1000000001000000000 


52 


1 1000000000000001000000 


116 


1111 1000000010000000000 


53 


1 1000000000000010000000 


117 


1111 1000000100000000000 


54 


1 1000000000000100000000 


118 


1111 1000001000000000000 


55 


11000000000001000000000 


119 


111 1 1000010000000000000 


56 


11000000000010000000000 


120 


11111000100000000000000 


57 


11000000000100000000000 


121 


11111001000000000000000 


58 


11000000001000000000000 


122 


111 1 1010000000000000000 


59 


11000000010000000000000 


123 


11111 100000000000000000 


60 


11000000100000000000000 


124 


11111100000000000000001 


61 


11000001000000000000000 


125 


11111100000000000000010 


62 


11000010000000000000000 


126 


11111100000000000000100 


63 


11000100000000000000000 


127 


11111100000000000001000 



Table 5: Natural Number decomposition yielding 23 virtual bit-planes 
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7.4 The performance analysis : Comparison between Prime 
Decomposition and Natural Number Decomposition 

In this section, we do a comparative study between the different decompositions 
and its effect upon higher-bit-plane data-hiding. We basically try to prove our 
following claims, 

7.4.1 In k-bit Natural Number System, all the numbers in the range 
[0,k{k + l)/2] can be represented and only these numbers can 
be represented 

Proof by Induction on k: 

Basis; fc = 1, we can represent only 2 numbers, namely and 1, but we 
have, ^^^2*"^^ — 1' the numbers (and only these numbers) in the range 

[0, 1], i.e., [0, M^±i)] can be represented for fc = 1. 

Induction hypothesis: Let us assume the above result holds Vfc < n, n G H. 

Now, let us prove the same for fc = n + 1. 

From induction hypothesis, we know, using n bit Natural Number System, 
all (and only) the numbers in the range [0, !i(l^ti)] can be represented. Let us 



list all the valid representations in n bit, 

= bo,n-lbo,n-2 ■ ■ ■ 6o,i&o,o = 0000 ... 00 

1 = 6i,„_i6i,„_2 . . . 6i,i6i,o = 0000 ... 01 



Now, for (n -|- 1) bit Natural Number System, we have the weight corre- 
sponding to the n*'* significant Bit (MSB), W{n) = n+1. 

So when the MSB is 0, we have all the numbers in the range [0, ^i^^tll] 



n(n -I- l)/2 = bn(n+l)/2,n-lK{n+l)/2.n-2 ■ ■ ■ 



bn(n+l) /2,lbn{n+l) /2fi 



1111 



11 



Obo,n-lbo,n-2 
06i,„_l6l,„_2 



06„(„+i)/2,„_l 

and when the MSB is 1, we get a new set of 



njn+l) 
2 



+ 1 numbers 



n + 1 + 0, 
n + l + l, 
n + 1 + 2, 



n+l + 



n{n + 1) 



2 
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i.e., all the (consecutive) numbers in the range + 1, ("+iK"+^) j 

1^0,n-1^0,n-2 
l&l,n-lfel,n-2 



l^n(n+l)/2,n-l 

So, we get all the numbers in the range [0, "("+^) ] (JJy^ -|- ("+iK"+2) j _ 
[0, ("+iK"+^) ] 

Also, the maximum number that can be represented (all I's) using (n + 1) 
bit Natural Number System. = (n + 1) + (n) + (n - 1) + . . . + (3) + (2) + (1) = 
(Ti+iXrt+2) ^ minimum number that can be represented (all O's) is 0. Hence, 
only the numbers in this range can be represented. 

Hence, we proved for A: = n + 1 also. ^ V/c £ H the above result holds. 



7.4.2 The proposed Natural Number Decomposition generates more 
(virtual) bit-planes 

Using Classical binary decomposition, for a k-bit cover image, we get only k bit- 
planes per pixel, where we can embed our secret data bit. From equation ([s]), we 
get, Pn — (n.ln{n)) Since n + 1 = o (n.ln{n)), the weight corresponding to the 
n*^ bit in our number system using natural number decomposition eventually 
becomes much higher than the weight corresponding to the n*^ bit in the number 
system using prime decomposition. In n-bit Prime Number System, the numbers 
in the range [0, X]"=o^ P-i\ represented, while in our n-bit Natural Number 

System, the numbers in the range [0,X]r=o^ + 1)1 = [0'Sr=i = "^"2^^^ ] 
can be represented. Now, it is easy to prove that 3no e H : Vn > uq , we have, 

^i=n-l n{n+l) 
Z^i=0 ^ 2 ■ 

Hence, using same number of bits, it is eventually possible to represent more 
numbers in case of the number system using prime decomposition, than that in 
case of the number system using natural number decomposition. This in turn 
implies that number of virtual bit-planes generated in case of natural number 
decomposition will be eventually more than the corresponding number of virtual 
bit-planes generated by prime decomposition. 

From The bar-chart shown in Figure-10, we see that, in order to represent 
the pixel value 92, Natural number system requires at least 14 bits, while for 
Prime number system 10 bits sufhce. So, at the time of decomposition the 
same pixel value will generate 14 virtual bit-planes in case of natural number 
decomposition and 10 for the prime, thereby increasing the space for embedding. 

7.4.3 Natural Number Decomposition gives less distortion in higher 
bit-planes 

Here we assume the secret message length (in bits) is same as image size, for 
evaluation of our test statistics. For message with different length, the same 
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Figure 10: Maximum number that can be represented in prime and natural 
number decomposition techniques 



33 



i 
























1 























Figure 11: Weight functions for different decomposition techniques 

can similarly be derived in a straight-forward manner. 

In case of Prime Decomposition technique, WMSE for embedding secret 
message bit only in l*'^ (virtual) bitplane of each pixel (after expressing a pixel in 
our prime number system, using prime decomposition technique) = pf, because 
change in l*^ bit plane of a pixel simply implies changing of the pixel value by 
at most the prime number. From above, (treating image-size as constant) 
we can immediately conclude, from equation ([3|, for ^ > 

(WMSEith bitvlane) u n ^ W X h X pf ^ 9(f.log^(l)) 

\ L uLLpiii/LL. / Uecomposition ^ ^ \ 1 1 

In case of our Natural Decomposition, WMSE for embedding secret message 
bit only in l^^ (virtual) bit-plane of each pixel (after expressing a pixel in our 
natural number system, using natural number decomposition technique) — 
1)^. From above, (treating image-size as constant again) we can immediately 
conclude, 

{^^1^ lS/1 S E ith liitplane) ^f^fy^^f^i dumber Decomposition ^ ^' 

Since {I + 1)^ = o{P .log^ (l)) , eventually we have, 

(\V^ S E [th hitplajie) j^Q^Qjj^pQgj^f^Qj^ ^ (^WAISEith bitplane) p^j^^^ j^^^Q^pQ^j^^j^Qj^ 

The above result implies that the distortion in case of natural number de- 
composition is much less than that in case of prime decomposition. The plot 
shown in Figure- 11 buttresses our claim, it compares the nature of the weight 
function in case of prime decomposition against that of the natural number 
decomposition. 

So, from all above discussion, we conclude that Natural Number Decomposi- 
tion gives less distortion than Prime Decomposition technique, while embedding 
secret message in higher bit-planes. 

At a glance, results obtained for test-statistic WMSE, in case of our k-bit 
cover image. 
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(^^V AI S E ith l)itpla7ie) (jio^gg^^Q^i Binary Decomposition ^("^ )■ 
{^^^K^SEith hitplane) Pj.,^^^ QQ^Q^y^pQ^,^j^j^Q,j.-i^ ^(^ .loQ (0)' 

Also, results for our test-statistic PSNRu,orst, 

•(2^--i;- 



UPSNR^orst)i^HkUplan^^. . „ = lO-WiQ ( 

\ / C tassical Binary Decomposition \ \^ ) 

f \ ( {2^ - 



\ / Frime D ecomposition \ C.L .iUy yi j ^ 

{iPSNRi^orstUutplane)^ ^ = lO-^io (^^— ^) • (19) 

\ / JVaturat Jy umber Uecomposition \ -\- 1 J I 



From equations ( 18 ) and ( 19 ), we see that, WMSE gradually decreased from 
Binary to Prime and then from Prime to Natural decomposition techniques 
(minimized in case of Natural number decomposition) , ensuring lesser probabil- 
ity of distortion, while PSNR gradually increased along the same direction (max- 
imized in case of Natural number decomposition), implying more impercibility 
in message hiding. 

7.4.4 Natural Number Decomposition is Optimal 

This particular decomposition technique is optimal in the sense that it generates 
maximum number of (virtual) bit-planes and also least distortion while embed- 
ding in higher bit-planes, when the weight function is strictly monotonically in- 
creasing. Since, among all monotonic strictly increasing sequences of positive in- 
tegers, natural number sequence is the tightest, all others are subsequences of the 
natural number sequence. Our generalized model indicates that the optimality 
of our technique depends on which number system we choose, or more precisely, 
which weight function we define. Since weight function W : Z+ 1J{0} — > Z+ 
(Since we are going to represent pixel- values, that are nothing but non-negative 
integers, the co-domain of our weight function is set of non-negative integers. 
Also, weight function is assumed to be one-one, otherwise there will be too much 
redundancy) is optimized when it is defined as W(i) = i + 1, Vi G U{0}: 
i.e., in case of natural number decomposition. 

Since we have, the weight function W : U{0} ^ -^^i that assigns a 
bit-plane (index) an integral weight, if we assume that weight corresponding 
to a bit-plane is unique and the weight is monotonically increasing, one of the 
simplest but yet optimal way to construct such an weight function is to assign 
consecutive natural number values to the weights corresponding to each bit- 
plane, i.e., W{i) = i l,Vi G 2'+ |J{0} (We defined W(i) = i -h 1 instead of 
W(z) — i, since we want all-zero representation for the value 0, in this particular 
number system). Now, this particular decomposition in virtual bit-planes and 
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embedding technique gives us optimal result. We get optimal performance of 
any data-hiding technique by minimizing our test-statistic WMSE. For embed- 
ding data in Z*'' virtual bit-plane, we have {WMSE)ith utpiane = so 
minimizing WMSE implies minimizing the weight function W{.) , but having our 
weight function allowed to assume integral values only, and also assuming the 
values assigned by W are unique {W is injective, we discard the un-interesting 
case when weight-values corresponding to more than one bit-planes are equal), 
we can without loss of generality assume W to be monotonically increasing 
But, according to the above condition imposed on W, we see that such strictly 
increasing W assigning minimum integral weight-values to different bit planes 
must be linear in bit-plane index. 

Put it in another way, for n-bit number system, we need n different weights 
that are to be assigned to weight-values corresponding to n bit-planes. But, the 
assigning must also guarantee that these weight values are minimum possible. 
Such n different positive integral values must be smallest n consecutive natu- 
ral numbers, i.e., 1,2,3, ...,n. But, our weight function W{i) = i + 1, Vi S 
Z"*" 1J{0} merely gives these values as weights only, hence this technique is op- 
timal. 

Using classical binary decomposition, we get k bit planes only corresponding 
to a k-bit image pixel value, but in case of natural number decomposition, we 
get, n-bit pixels, where n satisfies. 



n > 



n'^ +n- 2''+^ + 2 > 
-1 + ^/2*^+^^ + 9 
2 ' 



(20) 



8 Experimental Results for Natural Number de- 
composition technique 

We have, again, as input: 

• Cover Image: 8-bit (256 color) gray-level standard image of Lena. 

• Secret message length — cover image size, (message string "sandipan" 
repeated multiple times to fill the cover image size). 

• The secret message bits are embedded into one (selected) bit-plane per 
pixel only, the bitplane is indicated by the variable p. 

• The test message is hidden ([26]) into the chosen bit-plane using different 
decomposition techniques, namely, the classical (traditional) binary (LSB) 
decomposition, Fibonacci 1-sequence decomposition and Prime decompo- 
sition separately and compared. 
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get, as output: 



As was obvious from the above theoretical discussions, our experiment 
supported the fact that was proved mathematically, i.e., we got more 
(virtual) bit-planes and less distortion after embedding secret message 
into the bit-planes in case of Natural and Prime decomposition technique 
than in case of Fibonacci technique and classical binary LSB data hiding 
technique. We could also capture the hidden message from the stego-image 
successfully useing our decoding technique. 

As obvious, as the relative entropy between the cover-image and the stego- 
image tends to be more and more positive (i.e.. increases), we get more 
and more visible distortions in image rather than invisible watermark. 

As recapitulation of our earlier experimental result, Figure-8 shows gray 
level (0 . . . 255) vs. frequency plot of the cover image and stego-image in 
case of data-hiding technique based on Prime decomposition. This figure 
shows that, we get 15 bit-planes and the change of frequency distribution 
(and hence probability mass function) corresponding to graylevel values 
is least when compared to the other two techniques, eventually resulting 
in a still less relative entropy between the cover-image and stego-image, 
implying least visible distortions, as we move towards higher bit-planes 
for embedding data bits. 

Figure-12 shows gray level (0 . . . 255) vs. frequency plot of the cover im- 
age and stego image in case of data-hiding technique based on Natural 
Number decomposition. We get 23 bit-planes and the change of frequency 
distribution (and hence probability mass function) corresponding to gray- 
level values is least when compared to the other two techniques, eventu- 
ally resulting in a still less relative entropy between the cover-image and 
stego-image, implying least visible distortions, as we move towards higher 
bit-planes for embedding data bits. 

Data-hiding technique using the natural number decomposition has a bet- 
ter performance than that of prime decomposition, the later being more 
efficient than classical binary decomposition, when judged in terms of em- 
bedding secret data bit into higher bit-planes causing least distortion and 
thereby having least chance of being detected, since one of principle ends 
of data-hiding is to go as long as possible without being detected. 

Using classical binary decomposition, we get here only 8 bit planes (since 

an 8 bit image), using Fibonacci 1-sequence decomposition we have 12 
(virtual) bit-planes, and using prime decomposition we have 15 (virtual) 
bit-planes, but using natural decomposition, we have the highest, namely, 
23 (virtual) bit planes. 

As vindicated in the figures 8 and 9, distortion is much less for natural 
decomposition, than that in case of prime. This technique can also be 
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i'iegc image with hidden data sit in different bit planes (p) 



Figure 12: Result of embedding secret data in different bit-planes using Natural 
Number decomposition technique 
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Figure 13: Comparison of WMSE values for different data hiding techniques 



enhanced by embedding into more than one (virtual) bit-plane, following 
the variable-depth data- hiding technique fS] . 

• Figures 13 and 14 show comparison of WMSE and PSNR values, respec- 
tively, obtained from experimental results. It clearly shows that even for 
higher bitplanes the secret data can be reliably hidden with quite high 
PSNR value. Hence, it will be difficult for the attacker to predict the 
secret embedding bitplane. 

• The cxpcrmental results were obtained by implementing the algorithms 
and data hiding techniques in C++ (open source gcc) and (gray-scale) 
Lena bitmap as input image file. Also the extraction algorithms that 
described for both the techniques run at linear time in length of message 
embeded. 



9 Conclusions 

This chapter presented very simple methods of data hiding technique using 
prime numbers / natural numbers. It is shown (both theoretically and exper- 
imentally) that the data-hiding technique using prime decomposition outper- 
forms the famous LSB data hiding technique using classical binary decomposi- 
tion and that using Fibonacci p-sequence decomposition. Also, the technique 
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PSNR Values Tor different data hiding techniques 



60.00 




Figure 14: Comparison of PSNR values for different data hiding techniques 



using natural number decomposition outperforms the one using prime decom- 
position, when thought with respect to embedding secret data bits at higher 
bit-planes (since number of virtual bit-planes generated also increases) with less 
detectable distortion. Wc have shown all our experimental results using the 
famous Lena image, but since in all our theoretical derivation above we have 
shown our test-statistic value (WMSE, PSNR) independent of the probability 
mass function of the gray levels of the input image, the (worst-case) result will 
be similar if we use any gray-level image as input, instead of the Lena image. 
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